Class-Guided Swin Transformer for Semantic Segmentation of Remote Sensing Imagery
نویسندگان
چکیده
Semantic segmentation of remote sensing images plays a crucial role in wide variety practical applications, including land cover mapping, environmental protection, and economic assessment. In the last decade, convolutional neural network (CNN) is mainstream deep learning-based method semantic segmentation. Compared with conventional methods, CNN-based methods learn features automatically, thereby achieving strong representation capability. However, local receptive field convolution operation limits from capturing long-range dependencies. contrast, Vision Transformer (ViT) demonstrates its great potential modeling dependencies obtains superior results Inspired by this, this letter, we propose class-guided Swin (CG-Swin) for images. Specifically, adopt Transformer-based encoder–decoder structure, which introduces backbone as encoder designs block to construct decoder. The experimental on ISPRS Vaihingen Potsdam datasets demonstrate significant breakthrough proposed over ten benchmarks, outperforming both advanced recent approaches.
منابع مشابه
EarthMapper: A Tool Box for the Semantic Segmentation of Remote Sensing Imagery
Deep learning continues to push state-of-the-art performance for the semantic segmentation of color (i.e., RGB) imagery; however, the lack of annotated data for many remote sensing sensors (i.e. hyperspectral imagery (HSI)) prevents researchers from taking advantage of this recent success. Since generating sensor specific datasets is time intensive and cost prohibitive, remote sensing researche...
متن کاملDeep Neural Networks for Semantic Segmentation of Multispectral Remote Sensing Imagery
A semantic segmentation algorithm must assign a label to every pixel in an image. Recently, semantic segmentation of RGB imagery has advanced significantly due to deep learning. Because creating datasets for semantic segmentation is laborious, these datasets tend to be significantly smaller than object recognition datasets. This makes it difficult to directly train a deep neural network for sem...
متن کاملLow-Shot Learning for the Semantic Segmentation of Remote Sensing Imagery
Recent advances in computer vision using deep learning with RGB imagery (e.g., object recognition and detection) have been made possible thanks to the development of large annotated RGB image datasets. In contrast, multispectral image (MSI) and hyperspectral image (HSI) datasets contain far fewer labeled images, in part due to the wide variety of sensors used. These annotations are especially l...
متن کاملSpatial Metadata for Remote Sensing Imagery
Mining the petabyte and growing archive of remotely sensed images to obtain the necessary information for land cover change studies becomes more difficult as more imagery is obtained and stored at various locations by government agencies or private companies. The increasing importance of networking with the requirement to move data sets between different servers and clients makes the data volum...
متن کاملA new process for the segmentation of high resolution remote sensing imagery
International Journal of Remote Sensing Publication details, including instructions for authors and subscription information: http://www.informaworld.com/smpp/title~content=t713722504 A new process for the segmentation of high resolution remote sensing imagery Z. Chen ab; Z. Zhao a; P. Gong b; B. Zeng c a Institute of Remote Sensing Applications, Chinese Academy of Science. Beijing 100101. Peop...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Geoscience and Remote Sensing Letters
سال: 2022
ISSN: ['1558-0571', '1545-598X']
DOI: https://doi.org/10.1109/lgrs.2022.3215200